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Abstract We obtain several optimal inequalities on tail probabilities for sums of independent 
and bounded random variables. The results complete an upper bound on tail probabilities due 
to Talagrand by giving a sharp lower bound and some explicit expressions for the constants 
of Talagrand's bound. In particular, we obtain an one-term asymptotic expansion for large 
deviations, which can be regarded as an inequality types of Cramer and Bahadur-Rao. The 
asymptotic expansion shows that the Talagrand inequality is sharp. 
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1 Introduction 

Let £1, £ n be a sequence of random variables (r.v.s) satisfying E£j = 0. Denote X n = J27=i 
The study of tail probabilities P(X n > x) has a long history. Interesting asymptotic expansions 
are obtained by Cramer (1938), Bahadur and Rao (1960), Petrov (1975, 1995), Saulis and 
Statulevicius (1991), Rozovky (2005), Bercu et al. (2006, 2011), Petrov and Robinson (2008) 
and Gyorh, Harremoes and Tusnady (2012). Various upper bounds have been obtained: see 
Nagaev (1979), Pinelis and Utev (1989), Talagrand (1995, 1996a), Ledoux and Talagrand (1996) 
and Nagaev (2002) in the independent case; see Talagrand (1996b), Rio (2002, 2012) and van 
de Geer and Lederer (2011) for empirical processes; see Haeusler (1984), van de Geer (1995, 
2002), Nagaev (2003, 2007), Bercu and Touati (2008) and Fan et al. (2012) for martingales; see 
Doukhan and Neumann (2007) and Delyon (2009) for sums of weakly dependent r.v.s.. A few 
results on lower bounds, we refer to Nagaev (2002) and Rozovky (2003) for sums of independent 
r.v.s. 

In the paper, we consider the sums of independent and bounded r.v.s (£i)i=i,...,n- Denote 
by a} = E^ 2 and a 2 = Y^=i Then it holds g = \fno\ in the independent and identically 
distributed (i.i.d.) case. The celebrated Bennett inequality [3] states that: if < 1 for all i, 
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then, for all x > 0, 

P{X n > xa) < B{x, a) := ( ^— ) . ( I ) 



ax— a 

X + a ' 



er 

However, Bennett's inequality ([1]) is not optimal. One of the most well-known improvements of 
((TJ) is Hocffding's inequality (cf. (2.8) of [ID]), which states that: if < 1 for all «, then, for all 
x > 0, 

P(X„ > aar) < ff„(x,cr) (2) 

| i { ,<* } (3) 

< 5(z )( r), (4) 

where (and hereafter) by convention (^z^:) n ~ x = 1 when x — n. By considering the following 
distribution 

a 2 In 1 
Pfa = l)= 'a, and Pfa = -cr 2 /n) = (5) 

Hocffding showed that @ is the best that can be obtained from the exponential Markov in- 
equality 

P(X n > xa) < inf Ee A(x "- x<j) , x > 0, 

due to the element equality inf \> E exp{A(^™ r\i — xa)} = H n (x, a) for all < x < -. 

However, Hoeffding's inequality ^) can be still improved. For sums of bounded random 
variables, Talagrand [37] obtained the following improvement of ©. Assume that (£t)i=i,...,n 
satisfy — B < £j < 1 for some constant B > 1 and all i. Talagrand showed that, for all 
0<x< cif , 

P(X n >xa) <( 0(x) + c 2 -\ inf Ee***"-*^ (6) 
\ a J a>o 

< ^©(1)+ c 2 ?p)H n {x, a), (7) 
where ci,C2 > are two absolute constants, 

0{x) = (l-<£(z))exp{y 

x t 2 

is a decreasing function in x > and ^(x) — —3= f e~~ dt is the standard normal distribution 
function. Since 

' < 0(x) < -, x > 0, (8) 



2tt(1 + x) ~ y/n{l + xy 

(see Feller [IS]), Talagrand's inequality ([TJ improves Hoeffding's inequality © by adding a 
factor of order 0(l/x) in the range < x < ci-g. 

Since the inequalities of Talagrand are valid in the range < x < Ci-g, it is important 
to know how large the constant ci can be to the practical use. For the sake of simplicity, 
consider the Rademacher. Denote by S n = £\ + ... + e n the sums of independent r.v.s Si with 
P(Ei = ±1) = \ for all % = 1, n. Thus, we take B — 1 and er = -^/n. The missing factor 
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Comparison for n = 49 




Fig. 1 We display n (x) and Q(x) as a function of x. 



e n {x) 



P(5 n > xy/n) _ P{S n > xy/n) 
H n (x, y/n) 



inf A > Ee A ( s "- a;CT ) 



(9) 



of Hocffding's inequality @ and 0(x) are displayed in Figure Q] From Figure [TJ we find that 
the factor 0(x) + C2-§ in Talagrand's inequality can dominate the missing factor n (x) 
in Hocffding's inequality for c\ smaller than 1 (for instant < x < 5 in Figure [TJ). However, 
when ci is increasing to 1 (i.e. x increasing to 7), the constant C2 in Talagrand's inequality ([7]) 
is increasing to a large number. In fact, since n {y/n) = 1, from ([7]). we have ci — > y/n(l — 
0{s/n)) ~ y/n as c\ — > 1. Thus, the constant ci in Talagrand's inequality J7J depends on c\ 
and ci is less than 1. 

In this paper, we give a sharp lower bound on tail probabilities and some information 
about ci and ci of Talagrand's inequality ([6]). Our main result Theorem [T] gives some explicit 
expressions for the constants c\ and C2 under a general assumption that (£i)i=i,..., n have (2+<5)th 
moments. In particular, Corollary Q] proves the following one-term asymptotic expansion (a 
type of one-term asymptotic expansion of Cramer [5] and Bahadur-Rao pQ): if ^ < 1 and 
E|^| 3 < £?E£f for some constant B > and all i, then, for all < x < 0.1 f, 



P(X n > xa) 



0{x) + 166* 



D 



inf Ee A ^"~ :!: ' T ), 

A>0 



(10) 



where \6\ < 1. Since > —1, we complete Talagrand's inequality ([6]) by giving a sharp lower 
bound. Taking into account the equality inf>,>o Eexp{A(^" r\i~ xa)} — H n (x, a), we prove that 
Talagrand's inequality ([TJ is attained regardless of c%. Moreover, by and (JTUJ), we find that 
the related errors between P(X n > xa) and 0{x) inf a>o Ee^ Xn ~ xcrS> converges to uniformly 
in the range < x = o(%) as — > 0, 



P(X n > xa) 



= l + o(l). 



0{x) inf A > Ee A ( x "- 2 " T ) 
To show the tightness of equality (fTTj) . we display the simulation of 

P(S n > x^i) P{S n > Xy^) 



(11) 



R(x, n) 



0(a;)inf A > o Ee A ( s "-^) 0(x)H n (x, y/n) 
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Approximation for n = 100 Approximation for n = 1000 
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Fig. 2 The tail probabilities and ratios R(x, n) are displayed as a function of x for various n. 

in Figure [21 which shows that R(x, n) is very close to 1 for large n's. 

In the proofs of our paper, we make use of the technique based on the conjugate distribution 
technique, which become standard for obtaining Cramer large deviation results. Such results 
can be found in Cramer [8] and Petrov [31] . We refine the result of Talagrand [37] using a 
method inspired by Grama and Haeusler [16] . 

The paper is organized as follows. In Section [3J we present our main results. In Section [31 
we present some auxiliary results. In Sections HHHl we prove the main results. 

Throughout the paper, write a A b = min{a, &}, a V b = max{a, 6} and a + = a V 0. Denote 
by 9 a number satisfying l^l < 1 and A/"(0, 1) the standard normal random variable. We also 
agree that CP 1 = oo. 

2 Main Results 

Let (^j)i=i,...,n be a sequence of independent real random variables with E£j = 0. In the sequel, 
we should use the following condition. 

(A) There exist two constants 5 € (0, 1] and B > such that, for all A > 0, 
m\ 2+S < B 2+s and J2 E ^ e ^ BA ) J2 

i=l i=l 
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Notice that condition (A) is satisfied for 6 = 1 if E|^| 3 < B~E£f holds. Indeed, since 
(E£2) 3 /2 < E|&| 3 < BE£ 2 , we have E£ 2 < B 2 and E|&| 3 < B 3 . By the inequality e x > 1 + x, 
it follows that fl?=i E£ 2 e A «- > £™ =1 E£? (1 + A£<) > (1 - BA) £" =1 E£ 2 for all A > 0. 

For <5 € (0,1], denote by C2+5 the Lyapunov constant defined as follows. Suppose that 
(£i)i=i,...,n have (2 + <5)th moments, i.e. E|^| 2+<5 < 00 all i. Then C2+5 is the minimum of all 
absolute constants C such that 



sup 



P(X n < xa) - $(x) 



< C 



|2+<5 



holds for all (&)i=i,...,n- It is known that 0.4097 < C 3 < 0.56 and that C 3 < 0.4784 in the 
identically distributed case (see [33]). For the binomial distribution (for < p < 0.5), Nagaev 
and Chebotarev [26] recently have proved that C3 < 0.4215. 
Our first main result is the following theorem. 

Theorem 1 Assume that (£j)j=i,...,n satisfy £j < 1 and condition (A) for all i. Then, for all 
< x < 0.25%, 

P{X n > xa) = (e(x) + ee x ) inf Ee A(x "- x<7) (12) 



whe 



<[{0{x)+e x )M\H n {x,a), (13) 



e* (1MB 2 3+s C 2+s Er=i E IC 



2+8 



1 - 2t \ Vtt a (1 - 2t) 5 / 2 



with t 



2xB/o 



1+^l-AxB/o ' 



Notice that infA>o Ee A ' x "~ a: ' T - 1 is sometimes written in the form exp{— nvl* (^f)}, where 
A* n (x) = sup A>0 {Aa;— — logEe AX ™} is the Fenchel-Legendre transform of the cumulant function 
of X n . When (i^)i=i,..., n are i.i.d, we call A*{x) = A„(x) the good rate function of LDP theory 
(see Deuschel and Stroock [IT] or Dembo and Zeitouni [10]). 

For a random variable £j without moment larger than 2, some improvements of Hoeffding's 
inequality ([2]) can be find, for instance, in Eaton [13], Bentkus [3]|4] and Pinelis [35] , In these 
paper, some larger classes of functions are considered instead of the class of exponential functions 
usually used in Markov's inequality. In particular, when £j are bounded £j < 1, Bentkus [3] 
showed that 

p| <ip°\ > '/,■ ! ii4i 




where r]i are i.i.d. with distribution ([5]), P° (Yli—i Vi > x ) is the log-concave hull of P(X)ILi '?« ^ 
x) and P° > P. As rji < 1, inequality (TI"4"|) is sharp up to an absolute constant Here we give 
an equivalent result to ([T4")l. Applying (|T5|) to ([11)) with _B = max{l, — }, we find that if & < 1, 
then, for all < x < 0.24min{f ,<j}, 



e 2 



P(X n >xa) < -P° J]i)i> 



, . 0(x)+O(l) max {-,-}•) ff„(x, a). rl.:>i 
2 V hr n 
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This equivalent result (IT51) shows us that (TT41 improves Hoeffding's bound H n (x, a) by adding a 
factor F„ (x, <r) =: ^ (6>(a;) + 0(l)max{i, f }) in the range < x < 0.24 min{^, a}. Moreover, 

in the i.i.d. case, we find that bound (fT4)l converges to \0{x)H n {x, a) for < x — o{n s / 2 ) as 
n — > oo, which is worse than 0(x)H n (x, a) related to bound (|13[) . 

Using Theorem [JJ and C3 < 0.56, we easily obtain the following corollary. 

Corollary 1 Assume that (£j)j=i,..., n satisfy < 1 and E|^| 3 < BE£f for some constant 
B > and a/Z i. Then, for all < x < 0.1-g, 



P(X„ > xcr) = f 6>(x) + 166*— ) inf Ee^*"" 2 ^ 
V a ) 

< ((©(a;) +16^ Al) H n (x,a). 



(16) 
(17) 



Note that the lees-than sign in Talagrand's inequality (J6j) is replaced by the equal sign in 
(|16[) . Thus equality (|16[) completes Talagrand's inequality ([5]) by giving a lower bound. Since 
inf a>o E exp{A(^™ r/i — xa)} — H n (x,a), by ([TBI , we find that (IT71) attains to equality under 
the law ([5]). Hence Talagrand's inequality p7|) is sharp regardless the constant 16. 

Inequality (|17p implies the following Cramer-type large deviations. 

Corollary 2 Assume that (£j)j=i,..., n satisfy < 1 and E|^| 3 < -BE£? /or some constant 
B > and aZZ i. Then, for all < x < 0.1-^, 



p(x n >x(t) < (1 -<?(.*; 

where x — , x and satisfies 

X = x (l — — +o{-) 
V 6(7 fi 



B 



1 + 16^2^(1 + x) — 



(18) 



> 0. 

(7 



The interesting feature of the bound ([18]) is that it closely recovers the shape of the standard 
normal tail for all < x — o{a) as a — > 00. 

The well-known asymptotic expansions of tail probabilities (see Petrov [31] ) show that x in 
Corollary [2] usually can not be replaced by x. However, in the following sub-Gaussian case, x 
can be replaced by x. 

Theorem 2 Assume that (£t)t=i,...,n satisfy £j < Oi and E|^| 3 < B~E£f for some constant 
B>{) and all i. Then, for all <x< 0.25f , 



P(X n > xa) < 1 - $(x 



l + c x (l + x) 



B 



(19) 



where 



with t 



V2- 



16V2^C 3 e2' 



2xB/o 



1+^/l-AxB/o 



1 ' V. (i-t)' , 

. In particular, for all < x < 0.1-g, we /iai>e < 32.47. 
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Note that the condition £j < ai which replace the condition £j < 1 in Corollary [5] is satished 
for Rademacher random variables (i.e., P(£i = ±1) = |). 

For two-sided bounded random variables < B with B > 0, for example Rademacher 
random variables, the following theorem shows that the constants c x in Theorem [T] can be 
further improved to a smaller one. Without loss of generality, we take B = 1, otherwise we 
consider &/B. 

Theorem 3 Assume that < 1 for all i. Then, for all < x < 0.606c, 

P(X n >xa) = ( 9(x) + 8—) inf Ee^*""^, (20) 
V a J A>o 

where 

2.24e^ e*+* 2 



with t = § exp |§§y|- ih particular, for all x > 0, 



P(X n > I( r) < ( (©(.r) + ^) A lj H n (x,a). (21) 



Moreover, if < x < 0.1a, then c x < 3.08. 

It is clear that (j2"Tj) improves the Hoeffding bound H n (x,a) by adding a missing factor 
(<9(x) + c x /a) A 1. 



3 Auxiliary Results 

Assume that £,i < B for some constant B > and all i. We consider the positive random 
variable 



Ee A & ' 

so that EZ„(A) = 1 (the Esscher transformation). Introduce the conjugate probability measure 
P\ defined by 

dP x = Z n (X)dP. (22) 
Denote by E\ the expectation with respect to Pa- Setting 

bi(X) = E A ^ = 1 , i = l,...,n, 

and 

we obtain the semimartingale decomposition: 

X k = B k (\)+Y k (\), k = l,...,n, (23) 

where 

k k 

fl fc (A) = 5] &i(A) and F fe (A) = £ W (A). 

i=l i=l 

In the proofs of Theorems [IE we shall need a two-sided bound of B n (X). To this end, we need 
some technical lemmas. 

For a random variable bounded from above, the following inequality is well-known. 
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Lemma 1 Assume that E£j = and < 1. Then, for all A > 0, 

Ee A «* < Be(A,a 2 ) =: exp{A} + — ^ cxp {-Aa 2 } . 



A proof of this inequality can be found in Bennett [2]. The last inequality is sharp and 
attains to equality when 



p (& = i) = ~\ t^~~2 and p & = -of: 



By Lemma [TJ we easily obtain the following estimation of the moment generating function 
Ee A «-. 

Lemma 2 Assume that (&)i=i,...,„ safe/y & < 5 and E|&| 2 + 4 < 5 2+<5 /or some constants 
B, S > and aZ/ i. Then, of < B 2 and, for all A > 0, 

Ee A «'<e^. (24) 

Proof. Using Jensen's inequality and E|^| 2+l5 < B 2+s , we deduce 

a 2 = E£ 2 <(E|&| 2+5 ) 2/(2+5) <5 2 . 

Since 4rBe(\,t) > 0, for all A,t > 0, the function 13e(A,i) is increasing in £ > for all A > 0. 
Hence, by Lemma Q] and ^ < 1, for all A > 0, 



cj2 i 2 



Ee A «* = Ee AS ^ < ——^ < . (25) 

This completes the proof of Lemma [2 □ 
In the following lemma, we give a two-sided bound for B n (X). 

Lemma 3 Assume that £j < B for some constant B > and all i. Then, for all A > 0, 

p b\ _ 1 

S n (A) < 

// (^i)i = i n satisfy condition (A) for all i, then, for all A > 0, 

/ B\\ , B 2 A 2 9 

S n (A)> h__jAe 2 o" . 

Proof. By Jensen's inequality, we have, for all A > 0, Ee A ^ > e AE ^' = 1. Since E^e^* = 
E^(e A ^ i — 1) > 0, A > 0, we obtain the first assertion, for all A > 0, 

n „A 



B n (X) < ]T E&e A «* = ]T / E ^ e ^ * 
i=l i=i - 70 

<W °u Bt dt 

i=x J ° 
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If (f<)<=i n satisfy condition (A), it follows that, for all A > 0, 



X K -' AC / J2 E ^ dt ( 26 ) 
i=i Jo i=i 

<.A n 



2 



Using Lemma [H we get the lower bound of B n (X), for all A > 0, 



2 ,2 

2 



t=i v 7 

which completes the proof of Lemma [31 □ 
Write the cumulant function 

n 

MA) = ^logEe A «*, A>0. (27) 

»=i 

We can easily obtain the following upper bound for the function !^ n (A). 
Lemma 4 Assume that < 1 for all i. Then, for all A > 0, 

MA) < "log ( z , 1 9/ cxp {-Act 2 /"} + i I ^ exp{A} 

Proof. Since the function 

/(A, t) = log (j±- t exp {-At} + ^ exp{A}^) , A, t > 0, 

has a negative second derivative for t > (see Lemma 3 in Hoeffding [20]), then, for any fixed 
A > 0, —/(A, t) is convex in t > and 

/(A,i)</(A,0) + ^/(A,0)i=(e A -l-AK t > 0. (28) 
Therefore by Lemma [1] and Jensen's inequality, we get, for all A > 0, 

n 

MA) <E/( A ' CT ') 

i=l 

<n/(A,a 2 /«) 

= n lo § ( i i ^ 2i ex P { - Acr V«} + i I exp{A} 

This completes the proof of Lemma □ 
For random variable Y n (X), A > 0, we have a rate of convergence to the standard normal 
law. Before present the result, we should firstly introduce into the conjugate variance of a 2 . 
Denote the variance of Y n (X) by ct 2 (A) = E\Y r 2 (\), A > 0. By the relation between E and E\, 
the following inequality is obvious 



W-E^^AiT- (Ee^) 2 )' A "°- 



i=i 



The following lemma gives some estimations of a (A) 
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(29) 



Lemma 5 Assume that (£$)$=i n satisfy £j < B and condition (A) for all i. Then, for all 

A > 0, 

(l-2BX)+a 2 <a 2 (X)<e BX a 2 . 
Proof. Since Ee A «' > 1, A > 0, and & < £?, we get, for all A > 0, 

n n 



»=l 



i=l 



This gives the upper bound of cr 2 (A). For all A > 0, it is easy to see that 



E&e A?I = / Ege***dt < / e Bt dtF,g = 
Jo Jo 

Using Lemma [5] and condition (A), we obtain, for all A > 0. 

n 

^ 2 (A) > £ 



e BX -l 
B 



(30) 



E£ 2 e A «* - (E&e^ 4 ) 2 



B 2 \ 2 



{l-BX)a 2 -{e BX -l) 2 B- 2 Y:^{m 2 



> 



> e- B ' x (l-BX- {e DA - \Y) a A 
Noting that ct 2 (A) > 0, by a simple calculation, we have, for all A > 0, 

o*(A) > (1 -2BX) + a 2 , 

which gives the lower bound of ct 2 (A). □ 
Next, we present the rate of convergence of Y n (X) to the standard normal law. 

Lemma 6 Assume that (£i)i=i,..., n satisfy £j < B and E|^| 2+(5 < oo for some constants S G 
(0,1], B > and all i. Then, for all A > 0, 

2 2+s C 2+s e BX 



Z B 2 X 2 
,BA i \2\ 2 



sup 
yen 



Y n {X) 
<f(X) 



< y - $(y) 



< 



a 2+s (X) 



2+8 



i = l 



Proof. Notice that Y n (X) — J^., 77^ (A) is the sum of independent random variables r]i(X) 
and E^?7i(A) = 0. Using the well-known rate of convergence in the central limit theorem (cf. 
e.g. Petrov [21], p. 115 ), we get, for all A > 0, 



sup 
yen 



Y n (X) 
a(X) 



<y\- #(1/) 



< 



2+8 



m< 



\2+8 



Using the inequality (a + b) 1+q < 2 q {a 1+q + b 1+q ) for a.b,q> 0, we deduce, for all A > 0, 



2+8\ 



i = l 



i = l 



<2 2 + s j2 E ^\ 2+s 

n 

<2 2+s Y / m\ 2+s ^- 

n 

< 2 2+s e BX E E |&| 2+5 - 
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Therefore, we obtain, for all A > 0, 

X(A) 
<f(A) 



sup 
yen 



< y - $(y) 



|2+<5 

^ JW(A) £f E|61 ' 



This completes the proof of Lemma [51 □ 
We are now ready to prove our second main result. 

Theorem 4 Assume that (£i)j=i,...,n satisfy £,i < B and E|^| 2+l5 < oo /or some constants 
5 G (0, 1], B > and a/Z i. Then, for all x > 0, 



P(X„ > xct) = ( (Act(A)) + 6>e x ) inf Ee^"^, (31) 



where 



2 3+6 C , 



-2+S {J) 



i=l 

toztft. A satisfying &^(\) = xa. 

The previous theorem is closely related to Talagrand's inequality © (cf. Theorem 3.3 in [37] 
for details). Here, we would like to highlight some differences between the two results. First, 
the less-than sign in Talagrand's inequality (J6j has been completed to the equal sign in ([3T1) . 
Second, Talagrand's inequality (j6j holds for < x — O(-g-), while Theorem [4] holds for all x > 0. 

Proof of Theorem^ According to ([2"2"jh we have the following representation: for all x, A > 0, 
P(X n > xa) 

= E A(^n(A) _1 l{x„>K CT }) 

— 1 {X„>x<t}J 

/ -Axo-+<P'„(A)-Ay, 1 (A)-AB„(A)+Axcr 1 \ 
= ^A(e l{Y n (\)+B n (\)-xa>0}) 

= e " v ; E A (e 1 nv ™ w J l{y 7l (A)+s„(A)-K<j>o})- 

Setting (7„(A) = A(Y n (A) + B„(A) - xtr), we get 



P(X n > xa) = e -^«H-*»W / e -*P A (0 < C/„(A) < t)di. 

Jo 



(32) 



The exponential function e A:EO "+' i '« ( A ) i n (|32|) attains its minimum at A = A, where A satisfies 
S£(A) - xa. Then, B„(A) = j£(A) = xa, f/„(A) = AF„(A) and 

e -\ X a+$ n (\) = j nf e -A^+*„(A) = inf E gA(X„-x CT )_ (33) 
A>0 A>0 

Using Lemma [6j we deduce 

oo 

e-*P x (0 < U n (X) < t)dt (34) 



e -A^(A) p _ ( Q < Un( j } < Aya(A)) Aa(A)dy 
e -A^(A)p^ < j \/'^q ! !) < y )\a(X)dy + 9e x 

J y°Wd<P(y) + 8e x 
e(\a(X)) +8e x , 



oo 

e~ 
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where 



e x = 2 Z 2 « + - Tm\ 2+S - 

a 2+s (X) 



Therefore, from (|32|) . for all x > 0, 

P(X n > xa) = (e (Aer(A)) + Oe^j inf Ee A( * n_xa) . 
This completes the proof of Theorem [4] 



4 Proof of Theorem \T\ 

In the spirit of Talagrand [37], we make use of 0(x) to approximate 0(Xa(X)) of Theorem [4] 
The following proof of Theorem Q] is a continuation of the proof of Theorem 2J 
Proof of Theorem^ Using (f3"3")) and Lemma [U we get, for all x > 0, 

inf Ee***" - *^ 

< inf cxp / — \xo- + nlog [ '-rrr- exp {— Acr 2 /n) H ^ , expjA} 

~ A>o M \l + a 2 n 1 1 ' 1 + cr 2 n 



= H n (x,a). 
Since |0'(x)| < , we deduce 



\6>(\a{\)) -0(x)\ 



< 



1 \x-Xa(X)\ 



v n Act (A) A x 2 

Using Lemma [3] and the inequality e~ x > 1 — x, x > 0, we have, for all < A < -g, 



BX 



(l - BA) Act < ( 1 - J Ae^'^cr < 



Bn(A) 



= X < 



e BX -l 



By the estimation of <r(A) in Lemma [SJ it follows that, for all < A < j^, 

\x-Xa(X)\ 
'e Bj -l 



< Act 



BX 



- \/T - 2BX V (e^ - (1 - BX) 



< 1.58e BX BX"a 



and 



AV(A) Ax 2 > (aV(1 - 2BA)) A (aV(1 - BA) 2 ) 



= A o- 2 (l - 2BA). 
Hence, ((36|) implies that, for all < A < j^, 



\0 (Xa(Xj) - 0(x)\ 



< 



1.58 e BX B 
(1 - 2BX)~o Z '' 



(35) 



(36) 



(37) 



(38) 



(39) 



(40) 
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Bv ([57]). it follows that (l - BX) A < f and 

2xB/cr 



B 



< BX <t=: '-. for all < x— < 0.25. 

1 + y/l-ABx/a ° 



Therefore, by Lemma [5] it is easy to see that 



2+5 



a*" (A) (-< 



< 



2+6 



Combining ([21]) and ([4"0 ]) -([4"2" ]) together, we have, for all < x < 0.25-tt. 



(41) 



(42) 



P(X„ > xa) = ( ©(a:) + 6e x ] inf Ee 



A(X„-xcr) 



< (0(l)+ El )Al fl„(l,(7) 



where 



e* /1.58B , 2 3 + s C 2+s J2tim\ 2+S 



X 1 - 2t V (1 - 2t)*/ 2 (J2+5 

This completes the proof of Theorem [T] □ 

Proof of Corollary^ Since 0(x) is decreasing in x > 0, we deduce 6*(x) < (9 (£) where 
Notice that Hoeffding's bound is less than Bernstein's bound, i.e. 



H n (x,a) < exp \ — — 



(cf. Remark 2.1 of [14]). Therefore, from (|17p . we have, for all < a; < 0.1-; 



P(X„ > xct) < (e>(x) + 16^ exp|-y 

£ ( x 2 
= 1 - <P (x) + 16— exp <^ - — 
cr I 2 



Using (JSJ, we obtain, for all < x < 0.1 § 



P(X„ >xa) < (!-*(£)) 



B 



1 + 16V2tt (1 + x) — 
a 



This completes the proof of Corollary [5] 
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5 Proof of Theorem [2] 

Under the condition of Theorem [U we have the following upper bound of tf'n(A). 
Lemma 7 Assume that £j < <7j for all i. Then, for all A > 0, 

\ 2 rr 2 



2 

Proof. Using Lemma [TJ we have, for alH > 0, 



Eexp ■{ t— } < '■ < exp 



Hence, for all A > 0, 



f \ 2 rr 2 *\ \ 2 \ 2 rr 2 

Ee^<ex P {^) and *„(A) < y = ■ 

^ J i=i 

This gives the upper bound of <Pn(A). □ 
Under the condition of Theorem [2j we have the following lower bound of ct 2 (A). 

Lemma 8 Assume that £i < B and E|^| 3 < B~E£f for some constant B > and all i. Then, 
for all A > 0, 

(1 - 5A)+e- s2A V <ct 2 (A). (43) 
Proo/. Denote by /(A) = E£ 2 e A?l Ee A ^ - (E^e A?i ) 2 - It is easy to sec that 
/'(0) = E£ 3 and /"(A) = E^e Afi Ee A ^ - (E^e^) 2 > 0. 
Thus, for all A > 0, 

/(A) > /(0) + /'(0)A = E£ 2 + AE£f. (44) 

Therefore, for all A > 0, 

2 _ E£ 2 e A ^E e A ^ - (E£e A ^) 2 Eg + AEjf 
A% (Ee A «-) 2 " (Ee A «-) 2 ' 

Using E|&| 3 < SEC 2 , we get, for all A > 0, 

, ge? + AEj| > (i-^Ee 2 

EA% - (Ee^) 2 - (Ee A «0 2 ' (45) 
Since E|&| 3 < BE£? implies that E|f;| 3 < B 3 , by Lemma[Hand ct 2 (A) > 0, it follows that 

^ + T?.C 2 



o- (A) > ^2^2 > (1 - BA)^e 



i=l 



This completes the proof of Lemma [51 □ 

Notice that the condition in Theorem [5] is stronger than that in Theorem [TJ We can easily 
prove Theorem [2] by (|12j) in Theorem [JJ However, in order to obtain a constant c x in Theorem 
[2] as small as possible, we make use of Theorem 0] to prove Theorem [3] 
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Proof of Theorem^ By Lemma [7J for all x > 0, we have 

inf Ee A(x "- x<T) = inf exp{-Axcr + <F„(A)} 

A>0 A>0 



< inf exp < — Xxa 



< exp < — 



(46) 



Since of < E|^| 3 < Ba 2 and £j < <7j, we have Oi < B and £j < B. Then, by Lemmas [5j [8J and 
inequality (I57|) . it follows that, for all < A < -g, 



|x-Act(A)| 

b\ _ l 



< Act 



BX 



VI - BXe^^- 



B\ 

V e 2 



SA 



e 2 



and 



< e BA SA 2 a 



—2 



B X 



A 5^(A) Ax 2 > ^A n 2 {l-BX)e 

> aV^i-ba) 1 - 3 . 

Hence, by we obtain, for all < A < -g, 

|0(A<r(A))- 9(x)\ < 



A A a 1 



B y 2 



B 



- BX) 1 - 3 a 

By Lemma [51 it is easy to see that, for all < A < -g, 



Thus, from Theorem |4] it follows that, for all < x < 0.25§, 

, , B 







x 2 


) 


exp< 




B 




\ x 2 




exp < 




a 




I 2 



where 



1 - t 



V2- 



with i defined in ([iljl. Since 6>(a;) > 



, a; > 0, we have, for all < a; < 0.25-S, 



P(X„>a;cr) /— , ,B 



(47) 



(48) 



(49) 



In particular, if < a; < 0.1-g, using C3 < 0.56, we have \/2-kc x < 32.47 
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6 Proof of Theorem [3] 



For bounded random variables, the convergence rate of Y n (A) to the standard normal law in 
Lemma [5] can be improved to the following one. 

Lemma 9 Assume that (£i)i=i,..., n satisfy < 1 for all i. Then, for all A > 0, 

1.12 



sup 
yen 



< 



a(A) 



Proof. Using the well-known rate of convergence in the central limit theorem in Petrov [31) 
p. 115, and C3 < 0.56, we get, for all A > 0, 



sup 



Y n (\) 
a(X) 



<y\- $(y) 



< 



0.56 



X\Vi 



Since |rfe| < |&| + Ea|&| < 2, it follows that £f=i |^ | 3 < 2ct 2 (A). Therefore, for all A > 0, 

1.12 



sup 
y eR 



< 



which completes the proof of Lemma [9) 

Lemma 10 Assume that (£j)j = i „ satisfy < 1 for all i. Then, for all A > 0, 

B n (X) > (l-e- A )e-^a 2 . 
Proof. Since > -1, for all A > 0, 



n n r \ 

i=l i=l J ° 

> / e-'di^Eg 



(50) 



= (l-e"V. 

Using Lemma [5] with £> = 1, we get the lower bound of B n (X): for all A > 0, 



Ee A «* 



> (1- e" A )e" — ct 2 



This completes the proof of Lemma [TU1 



Proof of Theorem [5) The proof is close to the proof of Theorem [TJ but here B = 1. From 
(pi)) , by Lemma O we deduce 



e~ f P x (0 < £7 n (A) < i)efe 

e -Ij/a(I) p _ ( < < Aya(A)) Aa(A)dy 

— — — — _ 2 24 

e - Ay CT (A)p < 1) < y) \<j(\)dy + 0==- 

cr(A) 

— — 9 9zL 



(51) 



W(X) 



cr(A) 
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where A satisfies &n(X) — B n (X) — xa. Therefore, we obtain the corresponding result of Theo- 
rem [¥] for two-sided bounded random variables: for all x > 0, 



P(X n > xa) = ( (Act(A)) + 6^-\ inf Ee A(x "- XCT 



(52) 



bmce Ae-Vcr 2 < (1 - e - x ) e 2 a 2 < B n (X) = xa (cf. Lemma [TUt . by Lemma [51 we have, for 
all0< A< 1, 



— x a 2 x I a; 2 e A 1 x (ex 2 . 

A ^^^. eXP (^}^ t=: . eXP {^)- (53) 
and 

A~V(A) A x 2 > A 2 (l - X)e-^a 2 A A~V X V 

= A 2 (1-A>-*V. (54) 
From ([55 )1 and <jl7 j) . it follows that, for all < A < 1, 

- - e A+ ^ 2 1 

Using Lemma [5] again, we obtain, for all < A < 1, 

2.24 2.24e?- 1 , s 

— =- < , (55) 

Returning to (|52l) . we get, for all < A < 1, 

P(X„ > xa) = (x) + 0^^) jnf Ee A ( x " 

where 

CX\ 2 - 24e ^ 
c K (A) 



(T / A>0 



Noting that c x (X) is increasing in A £ [0, 1), we have, for all < x < 0.606cr, 

P(X n > xa) = (e (x) + 9—) inf Ee A <*»-* a >, 
V a J \>o 

where 

2.24e-# e*+* 2 



x/T - 1 V5r(l - t) 

with i defined in (I53[) . In particular, using the inequality P(X n > xa) < inf A > Ee A ( x "" I(T ' < 
H n (x,a) (cf. (PBl ). we obtain, for all x > 0, 

P(X n >xa) < ((0(x) + ^j MjH n (x,a). 

This completes the proof of Theorem [3] □ 
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